ST - T C SC - 2 00 2 - 02 Huffman Coding with Unequal Letter Costs [ Extended

نویسندگان

  • Mordecai J. Golin
  • Claire Kenyon
  • Neal E. Young
چکیده

In the standard Huffman coding problem, one is given a set of words and for each word a positive frequency. The goal is to encode each word w as a codeword c(w) over a given alphabet. The encoding must be prefix free (no codeword is a prefix of any other) and should minimize the weighted average codeword size ∑ w freq(w) |c(w)|. The problem has a well-known polynomial-time algorithm due to Huffman [15]. Here we consider the generalization in which the letters of the encoding alphabet may have non-uniform lengths. The goal is to minimize the weighted average codeword length ∑ w freq(w) cost(c(w)), where cost(s) is the sum of the (possibly non-uniform) lengths of the letters in s. Despite much previous work, the problem is not known to be NP-hard, nor was it previously known to have a polynomial-time approximation algorithm. Here we describe a polynomial-time approximation scheme (PTAS) for the problem.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SC - 2 00 0 - 05 Optimal Pre x - Free Codes for Unequal Letter Costs : Dynamic Programming with the Monge PropertyProposed Running Head : Pre x Codes for Unequal

In this paper we discuss the problem of nding optimal pre x-free codes for unequal letter costs, a variation of the classical Hu man coding problem. Our problem consists of nding a minimal cost pre x-free code in which the encoding alphabet consists of unequal cost (length) letters, with lengths and . The most e cient algorithm known previously requires O(n2+max( ; )) time to construct such a m...

متن کامل

Optimal Prefix-Free Codes for Unequal Letter Costs: Dynamic Programming with the Monge Property

In this paper we discuss a variation of the classical Huffman coding problem: finding optimal prefix-free codes for unequal letter costs. Our problem consists of finding a minimal cost prefix-free code in which the encoding alphabet consists of unequal cost (length) letters, with lengths α and β. The most efficient algorithm known previously required O(n) time to construct such a minimal-cost s...

متن کامل

ar X iv : a da p - or g / 93 03 00 1 v 1 5 M ar 1 99 3 The length of a typical Huffman codeword ∗ Rüdiger Schack

If pi (i = 1, . . . , N) is the probability of the i-th letter of a memoryless source, the length li of the corresponding binary Huffman codeword can be very different from the value − log pi. For a typical letter, however, li ≈ − log pi. More precisely, P m = ∑ j∈{i|li<− log pi−m} pj < 2 −m and P m = ∑ j∈{i|li>− log pi+m} pj < 2 −c(m−2)+2

متن کامل

Asymmetrical two-level scalar quantizer with extended Huffman coding for compression of Laplacian source

1 1 Abstract—This paper proposes a novel model of the two-level scalar quantizer with extended Huffman coding. It is designed for the average bit rate to approach the source entropy as close as possible provided that the signal to quantization noise ratio (SQNR) value does not decrease more than 1 dB from the optimal SQNR value. Assuming the asymmetry of representation levels for the symmetric ...

متن کامل

Huffman Coding with Letter Costs: A Linear-Time Approximation Scheme

We give a polynomial-time approximation scheme for the generalization of Huffman coding in which codeword letters have nonuniform costs (as in Morse code, where the dash is twice as long as the dot). The algorithm computes a (1 + )-approximate solution in time O(n+ f( ) log n), where n is the input size.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002